Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs

نویسنده

Rolando Cavazos-Cadena

چکیده

Average cost Markov decision chains with discrete time parameter are considered. The cost function is unbounded and satisfies an additional condition which frequently holds in applications. Also, we assume that there exists a single stationary policy for which the corresponding Markov chain is irreducible and ergodic with finite average cost. Within this framework, the existence of an average cost optimal stationary policy is proved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time and Ratio Expected Average Cost Optimality for Semi-Markov Control Processes on Borel Spaces

We deal with semi-Markov control models with Borel state and control spaces, and unbounded cost functions under the ratio and the time expected average cost criteria. Under suitable growth conditions on the costs and the mean holding times together with stability conditions on the embedded Markov chains, we show the following facts: (i) the ratio and the time average costs coincide in the class...

متن کامل

Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

This paper presents sufficient conditions for the existence of stationary optimal policies for averagecost Markov Decision Processes with Borel state and action sets and with weakly continuous transition probabilities. The one-step cost functions may be unbounded, and action sets may be noncompact. The main contributions of this paper are: (i) general sufficient conditions for the existence of ...

متن کامل

On the optimality equation for average cost Markov decision processes and its validity for inventory control

As is well known, average-cost optimality inequalities imply the existence of stationary optimal policies for Markov decision processes with average costs per unit time, and these inequalities hold under broad natural conditions. This paper provides sufficient conditions for the validity of the average-cost optimality equation for an infinite state problem with weakly continuous transition prob...

متن کامل

Drift and monotonicity conditions for continuous-time controlled Markov chains with an average criterion

In this paper, we give conditions for the existence of average optimal policies for continuous-time controlled Markov chains with a denumerable state–space and Borel action sets. The transition rates are allowed to be unbounded, and the reward/cost rates may have neither upper nor lower bounds. In the spirit of the “drift and monotonicity” conditions for continuous-time Markov processes, we pro...

متن کامل

A Semimartingale Characterization Ofaverage Optimal Stationary Policies Formarkov Decision Processes

This paper deals with discrete-time Markov decision processes with Borel state and action spaces. The criterion to be minimized is the average expected costs, and the costs may have neither upper nor lower bounds. In our former paper (to appear in Journal of Applied Probability), weaker conditions are proposed to ensure the existence of average optimal stationary policies. In this paper, we fur...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Kybernetika

دوره 25 شماره

صفحات -

تاریخ انتشار 1989

Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs

نویسنده

چکیده

منابع مشابه

Time and Ratio Expected Average Cost Optimality for Semi-Markov Control Processes on Borel Spaces

Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

On the optimality equation for average cost Markov decision processes and its validity for inventory control

Drift and monotonicity conditions for continuous-time controlled Markov chains with an average criterion

A Semimartingale Characterization Ofaverage Optimal Stationary Policies Formarkov Decision Processes

عنوان ژورنال:

اشتراک گذاری